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The LISP Machine Macro- instruct ion Set. 

This document describes the LISP Machine's interpreted order code, 
referred to belou as *" macrocode." The macrocode is designed to be highly 
bit-efficient, and ueil-suited to LISP. The compilation of LISP into macrocode 
is very straightforuard, as ui 1 1 be shoun in examples belou. 

Macrocode is created by the "macrocompi ler," uhich is called "QCMP." 
It is never neccesary to urite out macrocode manually, because it correspondes 
so closely with the LISP source code that it uould be easy to urite the same thing 
in LISP. 

. Uhen QCMP is run on a function, it is said to "macrocompi le" the function* 
It produces one "Function Entry Frame," referred to belou as a FEF, for each 
function compiled. The resulting LISP pointer has datatype DTP-FEF-PO INTER, 
and points to the FEF, uhich is a block of memory at least 8 uords long. 

The FEF has several sections. The first section is aluays 7 uords long, 
and contains various information about the format of the FEF and information 
about hou the function should be Invoked. For complete details of the bit 
layouts of these uords. see the FORMAT document. 

The next section of the FEF contains pointers to the VALUE cells and 
FUNCTION cells of symbols. These pointers are of datatype DTP-EXTERNAL-VALUE -CELL-PO I NTE! 
and are used as an "exit vector", that Is, compiled code can refer to these pointers 
in order to access special variables. 

The next, section contains the Argument Description List (ADD. 
The ADL contains one entry for each argument uhich the function expects to be 
passed, and contains all relevant information about the argument: uhether it is 
required, optional, or rest, hou to initialize it if It is not provided, 

whether it is local or special, datatype checking information, and so on. i 

Sometimes the ADL can be dispensed uith If the "fast argument option" can | 

be used instead; this helps save time and memory for small, simple functions. 
The Jutaili; can be found in the FORMAT document. 

The next section of the ADL contains various constants uhich the 
function might uant to refer to: if the function includes (FOO '(A B)), 
then the list (A B) uould be put in the constants area so that the microcode 
can refer to i t. 

The rest of the FEF is the actual macro! nstruct ions themselves. 
Each macro instruct ion is IB bits long, and so two macroinstructions 
are stored in each uord of the LISP machine. There are four conceptual 
"classes" of microinstructions, each of uhich is broken doun into fields 
in a different uay. 

CLASS I: 



3 I 4 i 3 i B 
DEBT. OPCODE REGISTER OFFSET 



There are nine class I instructions, designated by 8 through 18 (octal) 
in the OPCODE field. Each instruction has a source, uhose address 
is computed from the "REGISTER" and OFFSET fields, and a destination 
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given by the DESTINATION field. The instructions are: 
OPCODE NAHE 







CALL 



CALL0 



Open a call bioc)^ on the stacl^, to call the function 

specified by the address. Whatever the function returns 

Mill go to the destination. The actual transfer of control 

will not happen until the arguments have been stored. 

CSee destinations NEXT and LAST.) 

CALL in the case of a function uith no arguments. 

The transfer <tf control happens immediately. 

Move the contents of E to the destination. 

Put the CAR of the contents of E in the destination. 

Analogous. 

Analogous. 

Analogous. 

Analogous. 

Analogous. 

The effective address, E, is computed from the "register" and the offset. 
The instructions really use addressing relative to some convenient place specified 
by the "register" field. The register may be: 



2 


MOVE 


3 


CAR 


4 


COR 


S 


CADR 


6 


CDDR 


7 


CDAR 


8 


CAAR 



REG 







FUNCTION 



FEF 



FEF+100 
FEF+200 
FEF+300 
CONSTANTS PAGE 



LOCAL BLOCK 
ARG POINTER 
POL 



This is the starting location of the currently-running 

FEF. This is hou the macrocode addresses the pointers 

to value and function cells, and the constants area. 

Same as 0, plus 100 octal. 

Analogous. 

Analogous. 

This is a page of uidely used constants, such as T, NIL, 

small numbers, etc. There is only one constant page in 

machine. They are kept all on one page so that they can 

be shored among all FEFs, so that they wi I I not have 

to be repeated in each FEF*8 constant area. 

This is the address of the local blocl^ on the PDL, 

and is used for addressing local variables. 

This is the argument pointer into the PDL, and is 

used for addressing arguments of the function. 

The offset must be 77. The top of the stack is 

popped off and used as the operand. The other 

possible values of offset are not currently used. 



(See the FORMAT file for hou the PDL frame for each function is divided 

up into header, argument block, local block, and intermediate result stack.) 

Note: The first 4 addressing modes are all provided to allow an effective 

8-bit offset into the FEF. 
Note: The same register-offset scheme is used in the class II instructions. 

An additional complication in computing the "effective address" comes from 
invisible pointers. Once the register and offset have been used to compute- 
an initial effective address E, the word at that location is examined (even 
if this is an instruction Mhich uses E as a destination.) If the data type 
of that uord is "Effective Address Invisible", the pointer field of that word 
is used as the effective address E. This is used, for example, to access 
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value cells of special variables. The FEF "register" Is used, and the location 
of the FEF addressed contains an effective address invisible pointer uhich 
points to the desired value cell. This scheme saves bits in the instruction, 
uithout requiring the use of extra instructions to make special value cells , 
addressable. 

The destination field is someuhat more complicated. First of all, 
before the result is moved to the destination, two "indicators" are set. 
The indicators are each stored as a bit, and correspond to processor status flags 
such as N and Z on the PDP-11. They are called the ATOM indicator, uhich is set If 
the result of the operation is an atom, and the NIL indicator, uhich is set if 
the resuly is NIL. The class III instructions (BRANCH) may look at 
the indicators. 

Note: In actuality, there are not actually any physical indicators. 

Instead, the last result computed is saved in an internal register, and examined 

by the BRANCH instructions. The functional effect is the same. 

The destinations are: 

DEST FUNCTION 

BBSS BBBSCBSB 

8 IGNORE This is the simplest; the result is simply discarded. 

It is still useful, because it sets the flags. 

1 TO STACK This pushes the destination on the stack, uhich is 

useful for passing arguments to Class IV instructions, et 

2 TO NEXT This is actually the same thing as TO STACK, but it 

is used for storing the next argument to the open 
function when it is computed. 

3 TO LAST This is used for storing the last argument of the 

open function. It also pushes the result on the staci^, 
and then it "activates" the open call biock. That is, 
control will be passed to tlie function specified by 
the last CALL instruction, and the value returned by the 
function will be sent to the destination speci fied by the 
destination fieid of the call instruction. 

4 TO RETURN Return the result of the instruction as the value of this 

function, (i.e. return from subroutine.) 

5 TO NEXT, QUOTED This is the same as TO NEXT, except that for error 

checking, the USER-CONTROL bit of the word being 
pushed is set, telling the called function that it 
is getting a quoted argument (if it cares). 
(This is not implemented.) 

8 TO LAST, QUOTED Analogous. 

7 TO NEXT LIST This one is fairly tricky. It is used in conjunction 

with the LIST (Class IV) instruction to efficiently 
perform the lisp "LIST" function. It is documented 
under the LIST instruction. 

Note: B and B (the QUOTED) destinations have not been implemented as of 11/83/7B. 

Note: The same DESTINATION field is used by the class IV instructions. 

CLASS II: 
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I 7 

OPCODE 



I 3 I 
REGISTER 



6 

OFFSET 



The class II Instructions have no destination field; the result of the 
operation (if any) is either pushed on the stacl^ (like a destination TO STACK 
or TO NEXT in a class one instruction) or is stored at the effective address. 
The "register" and offset are used in exactly the same way 

as in the class I instructions, except that the E calculated Is sometitnes used 
as a destination Instead of a source. 

The instructions are broken up into three subgroups by the first three bits, 
of the opcode [in the microcode they are referred to as Non-destination instruction 
groups 1, 2 and 3), and then into the separate instuctions by the next four bits 
as follous: (a "-" in the left hand column means that this instruction pops the stack; 
a "+" means that it pushes something onto the stack.) 

GRP. OPCODE FUNCTION 



1 
1 

1 

1 

1 

1 

1 

1 

12 

12 

12 

12 

12 

12 

12 

12 

13 



13 
13 
13 
13 
13 
13 
13 



8 
1 

2 
3 
4 
5 
B 
7 

1 
2 
3 
4 
S 
6 
7 
8 

1 
2 
3 
4 
5 
6 
7 



/ 

AND 
XOR 
OR 

> 

< 

EG 

SCOR 

SCDDR 

1+ 

1- 

BIND 

BINDNIL 

BINDPOP 

SETNIL 

SETZERO 

PUSH-E 

nOVEM 

POP 



Not used. 

Adds C(E) to the top of the stack and replaces 
the result on the stack. 
Subtracts C(E) from the top of the stack. 
Analogous. 
Analogous. 
Analogous. 
Analogous. 
Analogous. 

\ These compare C{E) to the 
I -condition being tested is 
otherwise it is set. The stack 



top of the stack, and if the 
true, the NIL indicator is cl 
is popped. 



/ 

Get the COR of CCE), and store it in E. 

Analogous. 

Analogous. 

Analogous. 

The cell at E is bound to itself. The linear binding 

pdl is used. 

The eel I at E 

The eel I at E 

Store NIL in 

Store fixnum 



IS 

is 
E. 
in 



bound 
bound 



to 
to 



NIL. 

a value popped off 



the stack. 



E. 



Push a locative pointer to E on the stack. 
Move the data on top of the stack to E. 
Pop the top of the stack into E. 



CLASS III 



BRANCH 
CODE 



4 
(14) 



3 I 

OFFSET 



The class III instruction is for branching. There is a 3 bit Branch 
Code field which determines whether the branch happens, and sometimes 
what to do if it fails. It is decoded as follows: 



BRANCH 



Uednesday. February 21, 1979 81:85:39 AhLMDOC; MACRO 28 Page 1.4 

CODE FUNCTION 

ALWAYS Always branch. 

1 MILIND Branch if the NIL indicator is set, else drop through. 

2 NOT NILIND Branch if the NIL indicator is not set, else drop through 

3 NILIND ELSE POP \ These two are the same as NILINO and NOT NILIND, 

I except that if the condition fails, 

4 NOT NILIND ELSE POP/ the stacic is popped. 

5 ATOniND Branch if the ATOM indicator is set, else drop through. 
B NOT ATOniND Analogous. 

If the decision is made to perform the branch the offset, considered as a signed 

(tuo*s complement) offset is added to the PC (i. e. a relative branch, such 

as is used by the PDP-11). If the offset is 777, however, it is interpreted as _ 

meaning that this is a long-distance branch, and the real offset is obtained from the 

next (IB-bit) halfword. The PC added to is always the incremented PC; i.e. the 

address of the instruction +1 in the short case, and the address of the 

instruction +2 in the long case. 

CLASS IV 



3 I 4 I 9 

DEST. (15) OPCODE 



The class IV (miscellaneous) instructions take their arguments on the 
staci^, and have a destination field which works the same way as the Class I 
instructions. They all have 15 (octal) in the opcode bits, and the actual 
opcode is in the last 9 bits. Thus there can be up to 512. of them. 
Most of them may be called directly from interpretive LISP, and so 
some of them duplicate functions available in classes I and II. 
(Note on implementation: since there are far too many class IV instructions 
to dispatch on using the dispatch memory of the CONS machine, the starting 
locations of the routines are kept in main memory. The location of the 
base of the dispatch table is kept in A-V-M ISC-BASE at all times.] 

Since most of these functions are callable from interpretive level ' 
(the normal LISP user sees them), they form part of the nuclear system, and 
so are documented in the Lisp Machine Nuclear System document (LMNUC >) . 



The first 280 (octal) Class lY operations are not in the dispatch 
table in main memory, but are specially checked for. These are the "LIST 
instructions, which work in cooperation with the NEXT-LIST destination. 

Operations 8-77 are called LIST 8 through LIST 77. The list 
<N> instruction allocates N Q*s in the default consing area IA-CNSADF3 
which is initialized to a CDR-NEXT, CDR-NIL style list of NILs. Then 
the instruction pushes three words on the stack; 

1) A pointer to the newly allocated block, 

2) The destination foeld of the LIST instruction, 

3) A pointer to the newly allocated block (another copy). 

Note that the destination, as in the CALL instruction, is not used 
instantly; it is saved and used later. 

After the LIST instruction has been performed, further 
instructions can store to destination NEXT-LIST; once the macro-code 
computes the next arg of what was the LIST function in the source code 



n 



Wednesday. February 21. 1979 01:85:39 AI jinDOCiHACRO 28 Page 1.5 

It stores it to NEXT LIST. What destination NEXT LIST does is: the word on 

the top of the stack is taken to be a pointer to the next allocated 

cell. The pointer is popped, the result of the instruction is stored 

uhere it points, and then the CDR of the pointer is pushed back on to the stack. 

if the CDR was NIL houever, then we must be finished with the LIST 

operation, so the NIL is popped off the stack and discarded and a pointer 

to the neuly allocated area (another copy of uhich was thoughtfully stored 

on the stack) Is sent to the destination of the LIST <n> instruction (uhich 

uas also stored on the stack), and the tuo remaining Mords which the 

LIST <n> pushed are popped off. 

CLASS V 

Opcodes 16 and 17 (octal) are hot used and reserved for future expansion. 

EXAMPLES: 

Here is a (very) typical Lisp function, for computing factorials. 

(defun fact (x) 

(cond ((zerop x) 1) 

(t (* X (fact (1- x)))))) 

This is the macrocode produced by the compiler, typed out by the DISASSEMBLE 
function on the Lisp Machine. 

22 MOVE D-PDL ARG|8 ;X 

23 MISC D- IGNORE ZEROP 

24 BR-NIL 2B 

25 MOVE D-RETURN '1 

2G MOVE D-PDL ARG|8 ;X 

27 CALL D-PDL FEF|18 ; ©FUNCTION-CELL FACT 

30 MOVE D-PDL ARG|8 ;X 

31 MISC D-LAST 1- 

32 * PDL-POP 

33 MOVE D-RETURN PDL-POP 

The first thing (line 22) is to push argument 8 (x) onto the stack, 
and (line 23) check if it is equal to zero. Line 23 uses the ZEROP miscellaneous 
function, uhich sets the "indicators" to NIL if the quantity was not ZERO. 
Line 24 is a branch instruction which tests the "indicators"; if NIL is set, 
it ui I I branch to 28. If NIL was not set (the number was zero), it falls through 
to line 25, which returns the value 1. 

If the number was not zero (the second clause of the COND in the source), 
then control passes to line 2G, which pushes X on the POL (first argument to 
the multiply on line 32). Next line 27 opens a call to FACT. Line 38 subtracts 1 
from X (with the 1- miscellaneous function), and moves the result to "destination LAST". 
This result is thus the first and only argument to the recursive 
invokation of FACT, the result of which is left on the POL because of the 
destination field of the CALL instruction on line 27. 

Now (FACT (1- x)) and X are on the PDL, and they are multiplied by the 
multiply instruction on line 32. It leaves its result on the PDL, to be found by line 
33, which returns the result. 



